SemanticScuttle - klotz.me » klotz: machine learning+nlp+text

klotz: machine learning* + nlp* + text*

Bookmarks on this page are managed by an admin user.

Towards Generative AI for Model Architecture This bookmark is certified by an admin user.

With deep learning, the ROI for having clean and high quality data is immense, and this is realized in every phase of training. For context, the era right before BERT in the text classification world was one where you wanted an abundance of data, even at the expense of quality. It was more important to have representation via examples than for the examples to be perfect. This is because many Al systems did not use pre-trained embeddings (or they weren't any good, anyway) that could be leveraged by a model to apply practical generalizability. In 2018, BERT was a breakthrough for down-stream text tasks,

2023-11-11 Tags: deep learning, llm, generative, embeddings, bert by klotz

SemanticFinder - Frontend-only Semantic Search with transformers.js This bookmark is certified by an admin user.

2023-11-03 Tags: llm, semantic, search, embedding, cosine similarity, browser by klotz

Graph Learning for Exploratory Query Suggestions in an Instant Search System - Spotify Research : Spotify Research This bookmark is certified by an admin user.

2023-10-30 Tags: search, discovery, query suggestion, auto suggest, graph, node2vec, embedding, naive users, mounia lalmas, spotify, research by klotz

Mastering Customer Segmentation with LLM This bookmark is certified by an admin user.

Unlock advanced customer segmentation techniques using LLMs, and improve your clustering models with advanced techniques

2023-09-28 Tags: llm, segmentation, kmeans, k prototype, tsne, pca, sentence, embedding, clustering, exploratory data analysis by klotz

T5 text-to-text Transformers This bookmark is certified by an admin user.

2023-06-29 Tags: t5, text, nlp, encoder -decodet, bert, transformers by klotz

LLM vector search This bookmark is certified by an admin user.

2023-06-23 Tags: llm, vector, search, embedding, python by klotz

openai-cookbook/Zero-shot_classification_with_embeddings.ipynb at main · openai/openai-cookbook · GitHub This bookmark is certified by an admin user.

Zero-Shot Classification
To perform zero shot classification, we want to predict labels for our samples without any training. To do this, we can simply embed short descriptions of each label, such as positive and negative, and then compare the cosine distance between embeddings of samples and label descriptions.

As shown above, zero-shot classification with embeddings can lead to great results, especially when the labels are more descriptive than just simple words.

The highest similarity label to the sample input is the predicted label. We can also define a prediction score to be the difference between the cosine distance to the positive and to the negative label. This score can be used for plotting a precision-recall curve, which can be used to select a different tradeoff between precision and recall, by selecting a different threshold.

2023-05-31 Tags: openai, classification, embedding, machine learning, zero-shot by klotz

Dense vector embeddings This bookmark is certified by an admin user.

2022-12-24 Tags: embedding, bert, word2vec, deep learning by klotz

Word embeddings | Text | TensorFlow This bookmark is certified by an admin user.

2022-11-10 Tags: embedding, doc2vec, tensorflow, word2vec, classification, neural network by klotz

Part E: Text Classification with an Embedding Layer in a Feed-Forward Network - Deep Learning Tutorials with Keras - Medium This bookmark is certified by an admin user.

2022-11-10 Tags: classification, doc2vec, embedding, neural network by klotz

SemanticScuttle - klotz.me

klotz: machine learning* + nlp* + text*

Linked Tags

Related Tags